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Abstract 

A new Bayesian significance test is adjusted for jump detection in a 
diffusion process. Tliis is an advantageous procedure for temporal data 
having extreme valued outliers, like financial data, pluvial or tectonic forces 
records and others. 
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g : 1 Introduction 
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In the context of financial data some variables often present strong discontinuities, 
the so-called jumps. The well-known stochastic differential equation driven by a 
Brownian motion of Black and Scholes [T] may then becomes unsuitable. 

Decision-making in the presence of jumps has been recently considered, in both 
theoretical and empirical work. Merton [TT] has proposed diffusion models with 
jumps where the logarithm of jump sizes is assumed to be Gaussian. Kou [7] has 
suggested a double exponential law for that variable and a more general case, with 
the power exponential distribution, has been considered by Galea, Ma and Torres 

a- 

Several inferential techniques have been developed in this area. Lee and Mykland 
[B] present a nonparametric approach, Chan [2J suggests maximum likelihood es- 
timation. Continuous time models face however a difficulty in detecting jumps, as 
available data are obviously discrete. 
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On the other hand, diffusion processes with jumps are inherently non-identifiable 
models: trajectories are sums of diffusion and jump processes. The usual method- 
ology of classical statistical inference then become inappropriate (Khalof, Saphores 
and Bilodeau [6]; Luan and Xie [9]). Finally, asymptotic sampling distributions 
are particularly inadequate for small samples in this context. 

This paper proposes a full Bayesian inference approach for the problem. The Full 
Bayesian Significance Test (FBST) was developed by Pereira and Stern [12] for 
sharp hypothesis testing in parametric models. The FBST is also the optimal 
solution for the considered decision problem, as shown by Madruga, Esteves and 
Wechsler |T0] , who obtained well-defined loss functions that make FBST a genuine 
Bayes test. Such loss functions are very useful in our context as the statistician 
may fix them to particular numerical descriptions of her world. A deep analysis 
and revision of FBST may be found in Pereira, Stern and Wechsler [IB] . 

This paper considers diffusion models with jumps driven by the following stochastic 
differential equation 



In the equation above, Wt is a standard Brownian motion, A^^ is a Poisson Process, 
and Vi are non-negative independent identically distributed random variables. The 
derivative fi represents expected return, and a represents the volatility. 

By making time discrete with unit steps, the equation above can be approximated 
by the difference equation 



B, Z, and X are independent random variables, B and Z having a Bernoulli and 
a standard Normal law, respectively. 

A natural parameterization of the previous convolution allows us to frame the 
problem of jump detection as a test of hypothesis. In this test, the null hypothesis 
of no jumps is a sharp hypothesis. 

Section 2 describes the formulation of the diffusion model with jumps and its 
discrete version. Section 3 presents the Full Bayesian Significance Test. The 
parameterization of the model and the application of FBST to it are seen in Section 
4. Section 5 has numerical results for both real and simulated data, yielding also 
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H + aZ + B ■ X. 
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parameter estimates of maximum posterior density. Section 6 discusses possible 
generalizations and presents conclusions. 



2 The jump-diffusion model formulation 

In this section we state the jump-diffusion model that motivated the statistics test 
that we are interested in. Let (fi, JF, P, {JF^}) be a completed filtered probability 
space on which is defined a Brownian motion W and a compound Poisson process 
J, both adapted to the filtration {J-t}- More precisely, we assume that the process 
J takes the following form: 



where N — {Nt} is a standard Poisson process with rate A, and {^ jis a sequence 
of i.i.d. nonnegative random variables. We assume that: 

1. for each j, Xj = log{Vj) has a given distribution; 

2. the process W, N, and X/s are independent; 

3. J^t — -/s : < s < i}, i > 0, augmented under P so that it satisfies the 
usual hypothesis. 

In our jump-diffusion model we assume that all economics have a finite horizon 
[0, T] , and the price of our underlying risky asset is given by the following stochastic 
differential equation: 



For notational simplicity and in order to get analytical solutions, the drift // and 
the volatility a are assumed to be constants, and the Brownian motion and jumps 
are assumed to be one dimensional. These assumptions, however, can be omitted 
to develop a general theory. 



Jt^Y^i^j-i), t>o, 



(1) 




(2) 
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2.1 Discrete Model 



The goal of this section is to approximate the equation given in ([2]) using the 
Euler method. We know that, from Protter [14j, the solution to the Stochastic 
Differential Equation ([2]), that give us the dynamics of the asset price, is given by 



S. = S„e(''--)'**nV.. (3) 

i=l 



Next, 



AS'f _ St+i — St 

S+ Si 



t 

= exp\^(^^-'^^/\t + a{Wt+^t-Wt)+ ^''x.j-l. (4) 

If At is small enough, we can reject the terms of greatest order from the Taylor 
expansion, approximating e^' by 1 + a; + /2. We obtain 

AC / Nt+At 1 

V ^ / i=Nt+l ^ 



Nt+At 
i=Nt+l 

where Z is a normal standard random variable and the unknown parameters are 
/i, which represents the expected return, a, the volatility, and A, the jump rate. 

As it was shown in Kou [7j, for Aj small enough we have: 



Nt 



+ At 



^Nt+A, w.p. AA; 



X,= ^-\^ (6) 



i=Nt+l 



0, w.p. 1 - AA. 



In other words, if 6 = \tc\ is sufficiently small, the return can be approximated in 
distribution by 

AC 

—^ = fi6 + aZV6 + B-X, (7) 

where i? is a Bernoulli random variable with P{B = 1) = X6 and P{B = 0) = 
1 - \5, and Z ~ A^(0, 1). Note that 
P{aV5Z + BX <x) = 

= P{aV6Z + X < x)P{B = 1) + P{(xV6Z < x)P{B = 0) 

= P{aV6Z + X < x)X6 + P{aV6Z < x){l- X6). (8) 
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So, returns can be modeled as the convolution (7\fbZ \B between independent 
normal random variable and a random variable with some distribution F . 

In this work, we will assume that 5=1 and random variable X has Bernoulli 
distribution with unknown parameter p. Then, BX has a Bernoulli distribution 
given by 

/(x) = = ^' (9) 

[ 1 — Ap, for X = 0. 

There are many alternative models depending on the distribution of the jumps, see 
for example Merton [11] , for the normal distribution, Kou for double exponential, 
and Galea, Ma and Torres [4j for a generalization of the previous works. 

From ([HD and ([2]), the density function of /x + aZ + BX has the following repre- 
sentation: 



f{y) = (1 - Xp)e^^'^~''> + Xpe^^y^'^''+'» . (10) 

V 271(7 ^ ' 



10 20 30 4D 

Figure 1: Time plot and smoothed density of simulated data, Ap = 0.05. 

Figure [1] shows a realization of the returns ASt/St and the density (ITO!) . for At = 1 
day, fi = 0, a = 0.2, \p = 0.05. In other words, for this process, there are about 
Xp = 5% jumps per year with size one. 




3 Full Bayesian Significance Test 

Let us consider a random variable D whose value d in the measurable sample space 
(fi, 5) is to be observed. 

Let be the parametric space, that is, a set such that Pr{-\6) is a well-defined 
probability measure in S, for all ^ G 6. Denote by (9, B, vr) a probability measure 
structure on such that vr determines a priori probability distribution on 6. 
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After observing data d, the information about 9 is updated by Bayes' theorem and 
quantified by the posterior probabihty law on G, tt^. 

Full Bayesian Significance Test (FBST) procedure is defined in the case when this 
posterior distribution has a density function with respect to Lebesgue measure. 

Let f{9) denote the priori density, f{d\9)^ the likelihood function of 9 after observ- 
ing data (i, and f{9\d), the posterior density of 9 given data (i, related by 

f{9\d) (X f{9)f{d\9). 

A precise hypothesis H can be defined as a submanifold Oq C O such that 
dimlQo) < dim{Q). This implies that the posterior probability of a precise hy- 
pothesis is null for an absolutely continuous posterior distribution: every precise 
hypothesis should be rejected in that case. 

In order to avoid such a drastic conclusion, FBST deals not directly with Gq, but 
with a sort of critical region defined by the level surfaces of the posterior density 
f{9\d). 

Let us define the tangential set Tq to the null hypothesis 6o as the set 

To = {9eQ: f{9\d) > /o}, where /o = snp f{9\d). (11) 

In other words, the tangential set to Bq considers all points "most probable" than 
©0, according to the posterior law. 

The posterior probability of Tq, 

Ko= [ f{9\d)d9, (12) 
is called its credibility. The evidence for the null hypothesis is then defined as 

ev(Go) = l-ft:o = l-7rrf(To). (13) 

So, if tangential set has high posterior probability, the evidence in favor of 0o is 
small; if it has low posterior probability, the evidence against Go is small. 

In Madruga et al. [lOj, the Bayesianity of the test of significance based on this 
evidence measure is showed, in the sense that there exists a loss function such that 
the decision for rejecting the null hypothesis is based on its posterior expected 
value minimization. 

The computation of ef (Gq) can be performed in two steps: a numerical optimiza- 
tion procedure to find /o, and a numerical integration to find kq. 
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4 FBST for Bernoulli jumps 



For the problem presented in Section [21 let us consider the parametric space 
= = (A,p,/i,a^) G R'^ : A,p G [0,l],/i G i?, > 0}. 

According to equation ( fTOl) . the likelihood for the parameter ^, given a sample 
d = (xi, . . . , may be written as 

Aj9e-^("'-('^+'))' + (1 - Ap)e-5^("^-^)'" 

i=l V ^/ii^ 

= (27r(T2)-"/2e-i^S-i("'-'^)' (14) 
X X^(Ape-^)'=(l-Ap)"-'=5]e^Sti-«^ 

A;=0 

From the above equation, we note that if Xipi = X2P2, then the parameters 
(Ai,pi, /X, 0"^) and (A2, J92, /W, o"^) are observationally equivalent, as they have the 
same likelihood from data d. 

The function J : 6 ^ T, I{X,p, fi, cr^) = (Ap, /i, o"^) is an identifying function, that 
is, it is a one-to-one transformation under that equivalence relation. (See Kadane 
[5] for more properties and examples of identifying functions.) 

Theorem 5 in Kadane [5] states that the Bayesian analysis can be done on T, from 
the prior induced on T by / and the likelihood function. 

Aiming to preserve the information of an experiment about the parameters of inter- 
est, we will define a prior distribution for 9 such that f{X,p, /i, cr^) = f{X',p', /i, a^) 
whenever Xp = X' p'. 

It will be assumed that (A, p) and (yU, cr^) are independent random vectors, yielding 

/(A,p,/i,(T^) = /(A,p)/(/i,a^), 
for every (A,p, /i, o"^) G O. 

The following prior density will be adopted for (A,p) 

f{X,p\p) oc {1- Xpf-^ para 0<A,p<l, 
with known (3 > 0. 

In order to ease some numerical aspects, we will assume a uniform on (0, a^) density 
for 0"^ and, given cr^, a uniform on {fiQ—ca, /io+co") law for /i, with hyperparameters 
(Jq, c> 0, ^ R- 



me) = n 
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The posterior density for 6 = {\,p,fi,a'^) is then 



fm oc f{d\d)f{\p)f{^i,a^) 
oc f{d\d){l-Xpf~'-l 



(15) 



a 



The null hypothesis, 



eo = {(A,p,/i,(T2)Ge:Ap = o}, 



(16) 



states that the process has no jumps, a sharp hypothesis. 

Under this formulation, the measure of evidence in favor of Go, ew(Go), defined 
by f|T^ . allows us to perform a significance test for 0o without having to mod- 
ify the posterior distribution (either by assigning positive probability to the null 
hypothesis or by enlarging it, both polemical solutions in the literature). 

5 Numerical results 
5.1 Simulation 

We used S-Plus to perform FBST on simulated data from the convolution of a 
Normal(5, 0.2^) and a Bernoulli with parameter \p having the values 0, 0.025, 
0.10, 0.35 and 0.50. 

Data were standardized 



where x and Sx are, respectively, the sample mean and sample standard deviation, 
with sample size n = 40. The hyperparameters for the prior distribution were 
chosen to be /5 = 1, o"o = 10, yUo = 0, c = 10. 

Table [T] presents the evidence values ef(0o) for the distinct rates simulated, and 
also maximum posterior estimates and the mean posterior estimate for 9' = {Xp, fi, a) 

Figure [2] presents the simulated data on time for a process having Xp = 0.025 as 
jump rate. The right-hand graph shows the data empirical density (in full line), 
and the density adjusted by the posterior mode (in dotted line). Data suggest the 
possibility of jumps, quantified by the small evidence in favor of the null hypothesis. 




ev{Qo) = .00382. 
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Xp 


ev{Qo) 


posterior mode {Xp, fi, a) 


posterior mean {Xp, fj,, a 





1 


(.0000, 5.045, .2087) 


(.4077, 4.629, .2139) 


0.025 


.00382 


(.06211, 4.993, .2071) 


(.08346, 5.005, .2044) 


0.10 


3.98e-8 


(.1494, 5.002, .1510) 


(.1438, 5.011, 0.1766) 


0.35 


3.47e-5 


(.3416,5.029,. 1719) 


(.3501, 5.021, .1933) 


0.50 


3.37e-6 


(.5193, 5.046, .1826) 


(.5184, 5.049, .2046) 



Table 1: In the second column, evidence values for several jump rates, Xp, and modal 
and mean estimates for 9' at the third and fourth column, respectively. 




simulated data 



Figure 2: Time plot and fitted density for frequencies of simulated data, Xp = 0.025, 
et;(eo) = .003817. 

Graph [3] shows the simulated data empirical density (full line) in each case and 
the density determined by the posterior mode (dotted line), for the various rates 
considered. 

It can be seen, a strong bimodality in the sample is well detected by the FBST 
procedure as expected. On the other hand, without a strong bimodality, the 
evidence in favour of Oq is large, as seen in the graph of simulated data with 
Xp = 0, where ef (6o) = 1. 

The computation of evidence values ef (6o) was based on the simulation of 400,000 
independent points uniformly distributed over the support of the posterior distri- 
bution, and having their non- normalized values, 7r{6\d), computed. Other 400,000 
points were uniformly generated on Bq, in order to obtain the sample maximum 
of the non- normalized posterior density on Oq, v^o- 

The posterior probability of the tangential set can be approximated by the ratio 
of the sum of values of n{9\d) larger than ipo to the total sum of 7i{9\d) for all 
generated points. 

The mean time for running of this program was 600 seconds approximately, in a 
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4.5 5.0 5.5 6.0 

(a) 



4.5 5.0 5.5 6.0 

(b) 




(<=) (d) 



Figure 3: Simulated Data empirical density (full line), density adjusted by posterior 
mode (dotted line), with jump rate and evidence, respectively: (a) Xp = 0, ev{@o) = 1; 
(b) Xp = 0.1, eviOo) = .004662; (c) Xp = 0.35, ev{eo) = 3.47e - 5; (d) Xp = 0.5, 
eviOo) = 3.37e - 6. 

standard PC. 

Other optimization numerical methods which are more efficient for determining 
(fQ and integrating on Tq, may be used. Our simple method, however, had a good 
performance for the problem of model choice between models having Bernoulli 
jumps or models without jumps. 

Furthermore, it can be generalized in a straightforward way for other jump. 
5.2 Real data 

Figure m presents Annual Maximum Rainfall Data (Maiquetia station at Venezuela 
central coast) during 1951-1998. Source: 

http: //www.blackwellpublishing.com/rss/Volumes/ Cv52p4.htm 

Coles and Pericchi [3] use these data to compare Bayesian and Classical methods 
on the prediction of the 1999 catastrophic maximum of 410mm. 
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1950 1960 1970 1980 1990 



Figure 4: Venezuela Annual Maximum Rainfall Data, 1951 to 1998. 

Data were transformed to the standard scale, as described in the previous sec- 
tion. The jump size was considered random, so the parametric space had one 
more dimension added to it, K, having a uniform prior distribution on [0,30], 
independently from the other components. 

The analysis obtained an evidence value in favour of Gq, ef (Gq) = .03519. The 
posterior mode and mean estimates are 

(Ap, /t, a, k)mode = (0.1447, -0.2682, 0.6130, 2.227), 

(Ap, /t, (J, k)mean = (0.1683, -0.2831, 0.6576, 2.181). 

In original units, the posterior mode estimate represents the model 

Rmode = R- 0.27sd{R) + 0.6lsd{R)Z + 2.2sd{R)Bi, 

and the posterior mean estimate represents the model 

Rmean = R- 0.28sd{R) + OMsd{R)Z + 2.2sd{R)B2, 

where R and sd{R) are the sample mean and the sample standard deviation, Z 
has standard normal distribution, Bi has a Bernoulli(.145) distribution and B2, 
has a Bernoulli(.168) law. 

As seen in Figure E], that estimate fits well the observed data. 

The posterior mode estimate is here the maximum likelihood estimate, as the 
chosen prior is uniform. 

The posterior distribution simulation yields other relevant probabilities: given a 
threshold /, let At be the event that values not smaller than / are recorded only 
after time t. In the example, At indicates that an annual maximum rainfall largest 
than / takes at least t years to be recorded. 
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0mm 50mm 100mm 150mm 20 40 60 80 100 120 140 

maximum daily rainfall 

Figure 5: Real Data Empirical Density (full line), posterior mean adjusted density 
(dotted line); qqplot for data and posterior mean model. Evidence in favour of non- 
existence of jumps is ev{@Q) = 0.0351866. 

P{A\d) = [ P{At\9)f{9\d)d9 
Je 

can be approximated essentially with the same values obtained by the previous 
simulation. 

Figure [6] shows this probability as a function of time t up to 400 years, for several 
thresholds. The curves correspond to threshold of 100, 150, 165, 180 and 350mm, 
respectively from left to right, for annual maximum rainfall. 



1.0 
0.8 
0.6 
0.4 
0.2 
0.0 

100 200 300 400 

Figure 6: Survival probability until time t, for thresholds q(90) = 100, q{98) = 150, 
165, 180, 350, as a function of time t, in years. 

As an illustration, a maximum value larger than 100mm, that corresponds to the 
ninth observed decile, has a 10% chance of being recorded after 27 years, and 
a 56% chance of being observed during the next ten years. An annual maximum 
larger than 165mm, not recorded in the sample, has a 3% chance of being recorded 
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during the next 10 years, 12% during the next 50 years and 20% probabihty of 
being observed during the next 100 years. 




Figure 7: Expected time, y axis, to reach threshold I, x axis. 

FinaUy, given a threshold I, we may demand the expected time until recording a 
value not smaller than /, T^: 

' P{S>l\d)' 

Figure [7] presents Expected times T;, to reach threshold / (indicated as abscisse). 
For instance, the expected time to record an annual maximum higher than 140mm 
is around 46 years, and 77 years is the expected time until an annual maximum 
surpasses 150mm. 

6 Conclusions 

In this work we are testing two kinds of models: diffusion processes versus diffusion 
processes with Bernoulli jumps. We could also be interested in testing for more 
general jump families, which allow, for instance, heavy-tailed distributions or dif- 
fusion processes having non-constant volatility against processes having constant 
volatility. 

The proposed test procedure extends naturally for each of theses families of models, 
as long as there is a sharp hypothesis to be tested. Modifications on the likelihood 
function are straightforward. The computational cost increases with the size of 
the parameter space, as the required probabilities are integrated directly on the 
latter. 
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Noninformative prior distributions may jeopardize mixture properties of the en- 
volved Markov chains and therefore slow down the convergence rate of other Monte 
Carlo methodologies. 

We stress the absolute continuity of the posterior distribution. The chosen priors 
are usually Lebesgue absolutely continuous, entailing absolute continuity of the 
posterior for the models in our context. 

Any sharp hypothesis has therefore null posterior probability and posterior ratios 
are not a good criterion to compare models defined by sharp hypotheses. 

The measure of evidence used in this paper is associated (Section [T]) to a decision 
problem. In the real decision problem, we could - and should - consider the loss 
function the minimization of its expected value is tantamount to performing the 
FBST. 

The ideas behind FBST are intuitive and easy to understand. Moreover, non- 
identifiability does not present a problem for the Bayesian operation. 

Finally, the Bayesian approach allows us to answer questions such as those risen 
in Section 15. 2[ with real data, in terms of posterior probabilities and not limited 
to mere parametres estimates. 
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